For this assignment, I choose the data set of Incarceration Trends Dataset from VERA. In all the data set it provide to me, I choose incarceration_trends.csv as my main source to analyze the unfair of incarceration. For recent year, people are pay much more attention on race equality in different aspects like working, education, or medical condition. For this research, I pay my attention to the situation of incarceration trends to study the specific problems in the system which we considered is fair to everyone as we considered before.
## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ──
## ✔ ggplot2 3.3.6 ✔ purrr 0.3.4
## ✔ tibble 3.1.7 ✔ dplyr 1.0.9
## ✔ tidyr 1.2.0 ✔ stringr 1.4.0
## ✔ readr 2.1.2 ✔ forcats 0.5.1
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag() masks stats::lag()
##
## 载入程辑包:'plotly'
## The following object is masked from 'package:ggplot2':
##
## last_plot
## The following object is masked from 'package:stats':
##
## filter
## The following object is masked from 'package:graphics':
##
## layout
##
## 载入程辑包:'scales'
## The following object is masked from 'package:purrr':
##
## discard
## The following object is masked from 'package:readr':
##
## col_factor
## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## # A tibble: 1 × 6
## mean_jail_rate mean_prison_rate mean_male_rate mean_female_rate
## <dbl> <dbl> <dbl> <dbl>
## 1 4.32 6.69 7.51 1.04
## # … with 2 more variables: mean_white_prison_rate <dbl>,
## # mean_black_prison_rate <dbl>
From the summary information list I get from the dataset above, I found that during all the population, the population in Jail(mean rate is 4.318663%) is less than the population in Prison(mean rate is 6.689772%). When I focus on gender problems which also been mentioned by people more frequently recently, I found that in all population, no matter in Jail or Prison, the average rate of Male(7.514543%) is higher than average rate of Female(1.0408%). After that, I move to the race problems which this assignment most care about, and I found that the average rate of black people(27.47658%) is much higher than the average rate of white people(4.463364%) .
## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
For this time trends chart, I compare how the Black and white Ratio change over time in New York County. The reason Why I choose New York County is because this county is always the central focus of race problem. The data in this county can be a good representation for that period of time.
## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Warning: Removed 2 row(s) containing missing values (geom_path).
## Removed 2 row(s) containing missing values (geom_path).
For this two variable comparison chart, I compare the general situation of different gender between female and male in Prison. It is easy to find that the size of female population is relative lower than male population is the whole history in New York County.
## Rows: 153811 Columns: 121
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (5): state, county_name, urbanicity, region, division
## dbl (116): yfips, year, fips, total_pop, total_pop_15to64, female_pop_15to64...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
In this map, I show the average value of Minority Proportion in different state to give a glimpse to the racial situation in the whole America. From this map, we can easily tell how serious the inequity of different is in different state of America.